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1. Introduction 

Gap scaling is found frequently in the context of scale invariance, such as equilibrium 
statistical mechanics of phase transitions [1] , growth phenomena [2] , reaction diffusion 
processes [3] or self organised criticality [4, 5]. In the present context, gap scaling [1] 
means that the nth moment of the observable s, denoted as (s"), in leading order scales 
hke ^^{1+"—^) in some large parameter t. The "gap" refers to the constant difference A 
between two exponents for n and n + 1. In a stochastic process recently introduced as 
the "totally asymmetric Oslo model" [6, 7], the moments of the area under a random 
walk along an absorbing wall was found to obey gap scaling with r = 4/3 and A = 3/2. 

If the moments scale asymptotically as power laws (possibly with logarithmic 
corrections) but the exponents do not increase linearly with n, they are said to display 
multiscahng. The presence of multiscahng is sometimes confused with the absence of 
scaling altogether and very few examples of simple dynamical processes are known, which 
can be shown by exact calculation to display multiscaling. It is therefore highly desirable 
to find a simple stochastic process, such as Brownian motion, for which multiscaling can 
be derived analytically. Motivated by a recent study of cluster aggregation [8, 9], we 
study the scaling of "sequential moments" or, more accurately, correlation functions 
of the area size between the trajectories of coalescing random walkers, illustrated in 
Figure 1. We hope that future research will make the link between our results and 
the results recently obtained by Munasinghe et al [10] on the scaling of the n-point 
correlation function of coalescing random walkers. 

Recent years have seen a considerable interest in the distribution of the area size 
under a random walker trajectory. It has been investigated in the context of "directed 
rice piles" [7, 11], as well as in the context of extreme value statistics [12, 13]. In both 
cases, the object of interest is the area between trajectories of coalescing random walkers 
with initial spacing xq, or, equivalently, the area under the trajectory of a random walker, 
which starts at time t' = at distance r{t' = 0) = away from an absorbing wall. In 
the next section, the model will be introduced in detail. 

Addressing problems of extreme value statistics, Majumdar and Comtet [13] 
constrain the ensemble to trajectories with r{t' — — Xq and, in addition, with 
r{t' = = xo at a particular "termination time" t' — t va. the limit xq — > 0. In this 
limit, they derive very elegantly the exact distribution function of the area sizes. Due 
to the lack of additional scales, and, correspondingly, due to the lack of a dimensionless 
parameter, in the case — > the scaling of all observables is readily derived from 
dimensional analysis, i.e. the exponents are immediately known and gap-scaling is a 
necessity for all existing moments. In contrast, we will be concerned with finite xq (so 
that a priori neither exponents nor type of scaling are known) and an ensemble not 
constrained by the termination time in the form r{t' = t) = xq. This problem has been 
addressed earher [11] using some of the results in [13] for a calculation of the distribution 
function of the areas in leading order of xq- 

Unfortunately, only the first moment, which is trivial, is known exactly. Very little is 
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Figure 1. A sequence of trajectories ro(i'), ri(t'), . . . starting at t' = with initial 
spacing xq and demarcating areas s\^S2,--- between them. The dashed lines indicate 
the "walls" at = and the terminating time t' = t. Whenever random walkers meet, 
they coalesce and perform the rest of the walk together. 



known about higher moments beyond leading order. We employ analytical calculations, 
based on the formalism introduced in [7] as well as extensive numerics to overcome this 
problem. In the course, we determine the exact second moment of the distribution, as 
well as the two point correlation function, which displays anti-correlations, sometimes 
interpreted as a hallmark of multiscaling [10, 14]. To our knowledge, these are the 
first non-trivial exact results for the area size distribution under a random walker with 
Xq > 0. 

At least three correlation functions are required to decide about the presence of 
multiscaling. So far, we have been unable to perform the calculation of the third 
moment analytically. We have therefore resorted to Monte Carlo techniques, which 
are, unfortunately, hampered by slow convergence due to rare events. Nevertheless, the 
numerics strongly suggests multiscaling. 

In the next section, we will introduce the model and its observables. The 
third section contains most of the analytical and numerical results, which are briefly 
summarised in the last section. 



Figure 1 shows a lattice realisation (sec below) of a sequence of random walkers 
i = 0, 1, 2, . . . with diffusion constant Dq and trajectories rj(t') which start at t' = from 
ri{t' = 0) = ixQ. Whenever two random walkers meet, they coalesce and the resulting, 
single walker continues its walk. The walkers arc indistinguishable and it is therefore 
irrelevant whether walkers actually run simultaneously or sequentially, in which latter 
case they are thought to stop as soon as they intersect the trajectory of another walker. 
This is the picture we will adopt henceforth. 



2. Model 
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For the sake of definiteness, all trajectories are considered to run from t' = to 
t' — t. If two walker coalesce, their trajectories coincide for the remaining time. The 
area Si between two walkers i — 1 and i with trajectories rj_i(t') and rj(t') respectively, 



is a random variable, the moments of which will be denoted by (s"), where () is the 
expectation value over all configurations. 

Given an individual trajectory, it is impossible to tell whether it is produced by 
merging walkers or not. Producing the trajectories sequentially, the only trajectory 
relevant to the fate of walker i is the trajectory i — 1. There is no transient (other 
than producing the very first trajectory), i.e. stationarity is reached immediately. 
That means, for example, that the expectation value (S1S2) is translation invariant, 
(S1S2) = (sjSi+i) for all i > 1. 

If the random walkers are unbiased, any sequence of areas (sj, Sj-|-i, ■ ■ ■ , Sj-i-Ti) is, 
by construction, as likely as the sequence (sj+„, . . . , Sj). This can be seen by 

mirroring the entire sequence which, in case of an unbiased walk, leaves the probabilities 
of individual trajectories unchanged. In the presence of a bias this is no longer case and 
so the joint probability of a sequence changes under inversion. Thus, in the presence of 
a bias, one expects, for example, (5^52) 7^ {^2^1), while the two averages would be equal 
in the absence of a bias. Of course, the observable itself might be inversion invariant, 
for example (S1S2) = (S2S1) regardless of a bias. 

As long as the two paths rj and ri_i have not merged, the variance of the difference 
Tj— rj„i corresponds to the variance of a random walker with twice the diffusion constant 
D = 2Dq, walking along an absorbing wall. However, while the individual trajectories 
Ti might be biased, i.e. in the presence of a drift, the difference does not have such a 
bias. It is therefore clear that whatever is said about the (single) area size distribution 
between random walkers with initial spacing xq, biased or not, equally applies to the 
(single) area size distribution under an unbiased random walker with twice the diffusion 
constant starting at distance r{t — 0) — Xq from an absorbing wall. However, the notion 
of a sequence is missing in the latter case. 

The main aim of this work is to calculate the correlation functions (S1S2), (S1S2S3), 
and (S1S2S3S4) and determine their asymptotes, that means the exponent 7„ and possibly 
the polynomial V so that 



suspecting that 7„ is not linear in n and/or that V is non-trivial. 

As it turns out, (S1S2) and indeed (sis„+i) can be derived from (s^), the calculation 
of which is detailed in the first part of the result section. The other two moments could 
only be tackled numerically, presented in the second part of the result section. 

Most asymptotes will be determined in the limit of large t. It is important to 
realise the role Xq plays in this context. The problem is fully parametrised hy t, D and 




(1) 




(2) 
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Xq. With the area having the same dimension as t\/Di, in case of vanishing xq the 
scahng of the expectation value of any product of these areas, say is fixed to 

be a corresponding power of tVoi, i.e. (siSg) oc t^/^. In fact, there cannot even be 
any corrections to this behaviour. However, if xq ^ 0, there is a dimensionless quantity 
Dt/x^, which can and, indeed, does change the exponents of the asymptotes and gives 
rise to corrections to scahng [15]. 

3. Results 

In this section, we will first discuss in detail the analytical results, starting with a 
brief review of the method which is used to calculate the second moment (s^) in closed 
form. After deriving its expansion in powers of t, some shortcomings of the perturbative 
approach of [7] are discussed. Using these results, the correlation function is 
derived, which includes the original problem (S1S2) as a special case. This part finishes 
with a discussion on the effect of anti-correlations and a brief section on higher moments. 

The second part of the result section is concerned with our numerical results. First, 
the lattice effects are estimated by a comparison of the numerical estimate for (S1S2) 
with the analytical result. After discussing some technical problems, the results for 
(S1S2S3) and (S1S2S3S4) are shown, indicating the presence of logarithmic corrections. 

3.1. Exact calculations 

When calculating correlation functions of the form {S1S2), one of the central observations 
is that they can be obtained by considering "local moments" of the form (s^) in a system 
with twice the initial spacing, | which will be discussed in detail below. We therefore 
start the section with the calculation of the second moment and its implications for 
earlier results, in particular the approximation scheme used in [7]. Knowing (s^) then 
allows us to calculate 

All analytical calculations are based on the hierarchy of differential equations 
introduced in [7], 

dti^nit, x; xo) = Ddlijjn{t, x; xq) + xniljn~i{t, x; xo) (3) 
with boundary conditions 

\imtpri{t,x;xo) = 6nfi6{x - Xo) (4) 
Mt,0;xo) =0 (5) 

Starting with n = 0, ipolxji; Xq) is the probability to find at a; a fair random walker 

along an absorbing wall with diffusion constant D = 2Dq that started at xq. For n > 0, 

il}n is the local contribution to the n-ih. moment of the area under the trajectory, see [7]. 

For n = one finds, consistently, (s°) = 1, for n > one has 

i*t 1*00 

(s")(i;a;o) = / dt' / dx'x'n'^n-i{t\x']Xo) . (6) 
Jo Jo 

X We thank Alan Bray for sharing this insight with us. 
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To ease notation, the dimensionless form of is introduced as 



,3\ " 



tljn{x,t]XQ) = — {^\ ^n{y,r) (7) 



Xq\D 

with y — x/xq and r = t/{xl/D), so that the differential equation reads 

dr'ipnir, y) = 9JV^„(t, y) + ynipn-i{r, y) (8) 

with 5{x—Xq) in boundary condition (4) replaced by Siy — l). The propagator r; |/o) 
of Equation (3) can be constructed easily, 



GiV: r; yo) = -j= - e j , (9) 

which at I/O = 1 coincides with ipQ{y,T), i.e. 

My, r) = G{y, r; 1) = ^e""^ sinh (|-) . (10) 

The formal solution is the hierarchy 

' dr' / dyWi^n-i{y\ T')G{y, r - r'; y') (11) 
-/o 

which is, as it turns out, not easily calculated in closed form. Only the first moment is 
calculated immediately 

{s){t;xo)= / dt' dx'x'ipo{t',x';xo) = xot . (12) 
Jo Jo 

It has been shown [7, 11] that the moments follow gap scahng, 

fe$rlj^ = -«^'""""c» (13) 

with r = 4/3 and A = 3/2 and C„ dimensionless. 

Using the hierarchy (11) and the definition of the moments (6), the second moment 

is 

{s'){t-xo) -^[^) I d^y^ dyyj^dr'j^ dy'y' 

1 / _ (.y~y')f _ (.y+y'f \ 1 / (y' -i^ (y'+i)' \ ,, 

X — e 4(7^ - e ——= e - e (14) 



A/4(r - r')7r V / \/4r'7r 

where T = T(t;xo) = -^tr- The integration of (14) would be fairly straight forward, 
was it not for the factors y and y' . 

There is no reason to expect convergence problems from the improper integrals, 
because the Gaussians (and even more so differences of Gaussians) effectively cut them 
off. We therefore feel confident when changing the integration order, so that the 
improper integrals are done first and in reverse order. The integration over y can be done 
immediately and produces only a factor of the form y' ^JA{t — t')tt. The integral over 
y' is slightly more complicated, as it is of the form J dy'y''^G{y' ,t'; 1), and it produces 
an exponential and an error function, all multiplied by powers of r'. Remarkably, the 
resulting integrand for the integration over r' and r is independent of r, so that the 
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integral over r' can be replaced by the same expression evaluated at r with a pre- 
factor T — T. Due to the presence of pre- factors involving various powers of r, the final 
integration over r produces many different terms: 

2 



180 V D 



'1 + 28T+132T 



+ (1 + SOT + 180T2 + 120T^)S 



1 



180 V D 



tD 

1 + 28— + 132 

Xn 



tD 



+ 1 + 30^ + 180 



Xn 



tD 



Xn 



+ 120 




(15) 



J. / tD 
3* - ( 1 + 30— 

Xn 



(16) 



\VItDj 

where £ is the error function E(u) = (2/0r) £ dxe-"^ . To our knowledge this is the 
first non-trivial moment of the area distribution under a random walker that has been 
calculated exactly. 

It is very instructive to expand the result in powers of T = T{t; Xq) — t/{x1/D): 



{xl/Df 



2^5/2 I 8 ^3/2 _ 1 ^ 1 

g^TT 6 IS^TT 

1 



150F 
1 



7^1/2 _ J_ + 1 

180 1260V7r 



J.-1/2 



_ 21-3/2 



(17) 



181440v^ 13305600v^ 
with the terms beyond leading order to be considered corrections to scaUng [15]. There 
are two terms without a factor 7r~^/^, namely —\T and — which are present in the 
same form, i.e. without any further factor, already in the exact expression (15). What 
makes these terms very different from all others is that they are not an odd-half power 
of T. In fact, to any order, all terms apart from — (1-|-30T)/180, are odd-half powers of 
T. In [7] only the odd-half powers had been anticipated, suggesting that no deviation 
from this form is possible, but that turns out to be incorrect now. 

It is instructive to understand why the systematic, perturbative expansion in [7] 
misses the two terms. It relies on a perturbative expansion of ipoi^/Jly , fxr) , (10), for 
large /x at fixed, finite y and r. The scaling in is then inherited by ipil^/JIy , fir) 
through the integral (11) and by (s^) {/it; Xo) through (6). This relation gives rise to the 
scaling of (s^) (t; Xq) by parameterising t in convenient units, t — /ito. Along the lines of 
a renormalisation group calculation [16], /i can be considered the "flow parameter" and 
to — x^/ D as the normalisation point. Using this procedure for all orders in /i produces 
the prediction of only odd-half powers of 

However, when evaluating the perturbative expansion of ipQ^^/Jiy , ^t) in powers of 
/i in the integrals producing ipi and (s^), one violates the premise that y and r are 
finite, as in the integral (11) r runs from to a finite value and the integral over y is 
to be evaluated at divergent upper bound. As a result, one finds integrals of the form 
Jq dr drV^/^"* for i = 0, 1, 2, . . .. For i = 0, 1 the integrals are convergent, producing 
terms of order T^/"^ and T^/^ respectively, however for i > 2 one [i = 2) or both (i > 2) 
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integrals diverge in the lower limit. Ignoring this divergence, i.e. only keeping the upper 
limit, produces the correct amplitudes nevertheless. We have tested the integrals for 
i = 0, 1, . . . , 5 by direct evaluation and comparison to the exact result in the form of the 
Taylor expansion (17), of course, with the terms of order and T" missing. In general, 
the integrals for the perturbative expansion give the following odd-half powers: 



xiy 1 + 30T 



D / 180 



^ 1^ (xlV^ 4^+^- (J + 1)!(-)-^- .... 

2^\d) ^ (!_,)(!_,) (2j + l)!(.-j)! • 

The terms 1/(3/2 — i) and 1/(5/2 — i) arc reminiscent of the integration 
dr dr'r'^/2~\ As long as none of the coefficients or the error term is divergent, the 
series expansion can be considered a simple Taylor series in n and therefore in particular 
the leading order terms are reliable, as in an asymptotic expansion. 

Similarly, one can use the perturbative expansion scheme to determine the 
amphtude of higher orders of (s^). We can confirm agreement with [11] of the amphtude 
of the leading order T^, expecting the next to leading order to be correctly predicted 
in the perturbative approach as well. Following the integration for (s^), now three 
integrals over r might diverge and one might speculate that the first term missed in 
the perturbative expansion is T'''/^ and the (actually divergent) amplitude for to be 
correct, a T^/^ to be missed, to be correct, T^/^ missed again, and terms of order T° 
and lower to be correct again. 

We now turn back to the expansion of the exact result. Equation (15). When it 
comes to the large distance behaviour of the correlation function, the second moment 
(s^) {t]XQ) will be evaluated for small T. Expanding Equation (15) for small T gives 

(s'>(i;xo) (19) 

^ + hr^ + e-^^ (-512T13/2 + 28672r^5/2 - O {T^"^)) 

Again, the leading, polynomial orders, which together with the pre-factor give just 
(s^) (i; xq) = x^t^ + (2/3) + . . ., are solely due to the two unusual terms with integer 
powers identified in (15). However, it is consistent with the method in [7] being a large 
T expansion, that they would have been missed there. All other terms in (19) are 
exponentially suppressed with a remarkably high leading power in the polynomial. In 
fact, the first six terms, starting with T^/^, all turn out to have vanishing amplitude. 

3.1.1. Two point correlation function Based on (s^) one can derive the two point 
correlation function by noting that 

{{s, + s,f)-{s\)-^{sl) = 2{s,s,) (20) 

and more generally with S = S2 + + . . . -\- s„_2 for n > 3 (where 5" = at n = 3) 

((si + Sf) (t; xo) - 2 ((si + S + Sn-if) {t; xo) + {{si + S + Sn-i + Snf) (t; Xo) 

= 2{s,Sn){t;xo) = 2cn-i{t;xo) (21) 
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Figure 2. Pnming the network of trajectories, such as the one shown in figure 1, by 
leaving only every nth trajectory produces an ensemble of walkers with initial spacing 
nxQ (trajectories removed are shown dotted). Pruning is identical to merging the 

enclosed areas, in this example n = 3 so that s[ = si + S2 + S3, s'2 = S4, + + sq, 

Such a joining/pruning scheme, mapping an ensemble with initial spacing xo onto an 
ensemble with initial spacing uxq is exact, even on the lattice. 

using translational invariance in the form (s^) = {s^-i} and ((si + S)sn-i) = 
{{S + s„_i)s„). Witii Un{t; Xq) = ((si + . . . + SnY) {t; Xq) and the convention U-i = Ui 
and Uq = 0, the correlation function is just the lattice Laplacian of the second moment 
Un{t;xo), 

2 (siSn+i) {t; Xo) = Un+i{t; Xo) - 2un{t; Xo) + Un-i{t; Xo) = V^M„ (22) 

for n > 0. Given Equation (15), the second moment 'U„(t; Xo) is easily calculated because 
Un{t;xo) = ui{t;nxo) = (s^) (t;nxo). This is illustrated in Figure 2: The network of 
trajectories is pruned by joining n consecutive areas Si, . . . , s„ to one large area. Thereby 
a sequence of larger areas is constructed, as if produced by random walkers with initial 
spacing nxo- The exact two point correlation function therefore is 

{siSn+i) {t; ^0) = ^ ( (s'> {t] \n - l\xo) - 2 (s^) {t; \n\xo) + (s^) {t; \n + l|a;o)) (23) 

for n e Z, with the exact expression given in (15). The modulus guarantees the validity 
even for n < 0, although the above results for (s^) (t; Xq) have been derived using Xq > 0. 

In the following, the asymptotes of (siSn+i) are examined. We first consider the 
behaviour of (siSn+i) {t;xo) in the limit of large t at fixed n. By dimensional analysis, 
(s^) {t;xo) consists of a pre- factor (xl/D)"^ and a dimensionless function in T = Dt/xl- 
According to the Taylor expansion (17), a term of order T^/^ gives rise to a contribution 
in total linear in xq, and lower order terms in T correspond to larger powers in xq. 
Under the provision of large T according to Equation (17) the powers are xq, Xq, Xq, Xq, 

Xq, Xq, Xq, Wheu taklug the lattice Laplacian according to (23), Equation (17) is 

evaluated term by term for xo — > nxo and xo — > (n± l)xo, which enters in the pre-f actor 
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lim it- xo) = {sY it- xo) = {txof = ( ^ ) T\t- x^) . (25) 



{xq/DY as well as in T(t; xq) = tD/xl- Since (n + 1) — 2n + (n — 1) = 0, the first power 
of Xq to contribute therefore is x^, so that in the limit of fixed n > and large t 

(siSn+i) (t; xo) (24) 

= ( i ] ( + + ^ ) j • 

again in terms of T = T{t;xo) — tD/xl- § As discussed below, (siSn+i) — {sY < for 
all T greater than some threshold and riy^O. 

Of course, in the limit of large n at fixed t, one expects asymptotic independence. 

This large n behaviour can be obtained from the small T expansion. Equation 
(19). The polynomial part together with the prc-factor {xq/DY contributes the 
expected constant {txQY, while T^{xq/ DY is independent of Xq and its contribution 
vanishes therefore after operating with the lattice Laplacian. The exponential pre- 
factor exp(— l/(4T(t; nxo))) = exp(— n^/(4T(t; xq))) renders the contribution from n — 1 
exponentially more relevant than from n or n + 1: 

X {-n-' - 7n"^ + 28(2r - l)n-^ + O (n-^°)) + O (e'^'^?) (26) 
in the limit of large n at fixed t. 

3.1.2. Anti- correlations The Laplacian structure (22) gives rise to rather peculiar 
behaviour of the error of the numerical estimate of (s). In a naive implementation 
one probably takes a series of N samples, Si, S2, . . . , sat and estimates (s) by 

1 ^ 

which is indeed an unbiased estimator, (s) = (s) [17], independent of A^. In the following, 
wc will use an over-bar to indicate numerical estimates, for example s for the estimate of 
(s). Considering its error, for large N one arrives at a picture as cartooned in Figure 3: 
The walkers start from ro(t' = 0) = 0, ri(0) = Xq, T2(0) = 2xo, . . . , TAr(O) = A^a^o, so that 
for sufficiently large N the very first one, tq, is extremely unlikely to interact with the 
very last one, rjq. The total area between tq and tat is exactly the sum over all areas, 

Ns^Ntxo+ j &t'[{rN{t')-NxQ)-ro{t')\ . (28) 

Assuming that the first and the last are in fact independent, suggests that the total 
area is well modelled by a Langevin type approach ro,Ar(t) — ro,Ar(t = 0) = dt'?7o,Ar(t') 

§ To avoid confusion henceforth the short hand T (without arguments) is used whenever it is to be 
evaluated at t and xq, i.e. where any n dependence has been removed, T(i; nxo) = n~^T{t; xq) = n~'^T. 
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Figure 3. Estimating the average area (s) by considering a sequence si,...,Sn is 
equivalent to considering the entire area enclosed by the first and the last trajectory, 
ro and tn (shown in bold) and dividing by the total number of walkers N. 



with two independent noise sources r]o^N{t) with vanishing mean and correlator 
(?7o,Ar(ii)?7o,Ar(^2)) = 2r^5(ii —12). This correlator has variance {rl^it)) ^ 2rH ^ 2Dot, 
so that = Do = D/2. Assuming ro and rjv are independent, the variance of the 
numerical estimate is 2Dt^/ (3A^^), which means that the error in the estimate from the 
correlated sequence Si, . . . , s^r vanishes faster than in the fully uncorrelated case, where 
the variance decays like a"^ (s) / N = 32D^/H^/'^Xo/{15Ny/7r)+. . .,witha^{s) = {s'^)-{sf. 

This surprising behaviour is explained by anti-correlations. To see this more clearly, 
we point out the "sum rule" 




2 



i=l 



N-l ^ 

J](iV - i) {t;xo) - - (s'> {t;xo) 

i=0 



(29) 



where the equality of the first and second line is reminiscent of the Laplacian property 
(22). When considering large N, comparing to (19) confirms, 

2 



and more importantly 

N 



(30) 



-Na'{s) + ^Dt' + ... 



(31) 



In other words: In the variance of the estimator (27), the anti-correlations fall short of 
canceUing the contribution of the variance of the area cr^(s) /Nhy only a small correction 
2Dt^/{3N^), see (30). Thus, in the limit of large T, the correlator — (s)^ drops 
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Figure 4. The numerical estimate of the correlation function (siSi+„) for n = 
1,2,..., 130 for xq = 2, t = 4096 and D = 1, shown in the form sisi+n/ (s)^ where 
(s) = Xot. The symbols represent numerical estimates (no error bar is shown), the 
straight line is the theoretical result (23). The second moment, n = 0, is not shown 
because it is orders of magnitude greater than (s)^. 



sharply from 

„3\ 2 



at n = 0, Equation (17), to 



{sis.+n) {t; xo) - {sf {t; Xo) = (^^) (^-T^ + ^riT'/' + . . )j (33) 

at fixed n > 0, see (24). Of course, in the hmit of large n asymptotic independence 
prevails and only exponentially decaying correlations are left 

(sisi+n) {t; Xo) - {sf {t; xq) (34) 
see (26). Figure 4 shows together with its numerical estimate. 



3.1.3. Other moments In the pursuit to determine whether the correlation functions 
(si) , (S1S2) ) (S1S2S3) , . . . display gap scaling or multiscaling in T, any third correlation 
function, other than (s) oc T, (12), or (S1S2) oc T^^^, (24), is to be determined. Along 
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the same lines as above and using translational invariance, one finds 

(S1S2S3) = \ ( {{si + S2 + s^f) - 2 ((si + S2f) - ((si + s^f) + 3 (s^) ) (35) 

= \ ( {{si + S2 + s^f) - 3 {s^) - 6 {sis2) - 6 {s^sl) - 3 (s^^g) - 3 {s^sl) ) (36) 

with the leading order of (s^) cx known from [7, 11]. In case of an unbiased random 
walk, the second line simplifies further, for example {s\s2) — Unfortunately, the 

above expression does not suffice to determine the leading order of (S1S2S3): On the right 
hand side of Equation (35) ((si + 53)^) is unknown, on the right hand side of Equation 
(36) effectively the same term is unknown, namely {s\s^) + (S1S3). The remaining terms 
can be determined using the form suggested above. 



(s3>(t;xo) = 




where only the leading fourth order has so far been proven to exist with Ci — 15/8. In 
particular, 

{sis,) + {sA) = I {{{si + s,r) - 2 (.^» = ' (2C.T3 + . . .) (38) 

and similarly for the leading order of 

6 (S1S2S3) + ((si + ssf) = ((si + S2 + ssf) - 2 {{si + S2f) + 3 (s^) (39) 
= 2Cir^ + MCsT^ + . . . , 

which is (35) with all unknown terms on the left hand side and all known terms on 
the right hand side. So, if (S1S2S3) has leading order higher or equal to ((si + 53)^), it 
must be T^, given that the leading order always has positive amplitude. If it is of lower 
order, it might not appear on the right hand side at all, since it might be cancelled 
by a negative amplitude from ((si + s^)"^). This is potentially a very useful numerical 
criterion, because if (S1S2S3) / ((si + ^3)^) docs not converge to 0, it means that (.S1.S2S3) 
has leading order T'^; from numerics, however, it seems that the ratio asymptotically 
vanishes. 

3.2. Numerics 

All numerical simulations have been done on the lattice, all analytical calculations in 
the continuum. On the lattice, in every time step the random walker takes one step 
in the up or down direction with probabilities p and q = 1 — p respectively. || The 
resulting diffusion constant is calculated from the variance after n steps, Anpq — 2Dot 
with t = nAt and units chosen so that At — 1 and D = 2Dq = 4pq. In most of our 
simulations, the walker was unbiased, p = q = 1/2, implying D — 1, and the initial 
spacing is chosen to be minimal, xq = 2.% This choice could, in principle, introduce 
discretisation effects, which are discussed below. 

II The random number generator used throughout this study is the "Mersenne Twister" [18]. 
^ Note that Xi{t') is even at even times and odd at odd times. 
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The central objective of the numerical approach was to determine the large 
T behaviour of (siS2=^3) and (S1S2S3S4). As mentioned earlier, the most naive 
implementation estimates (siS2 ■ ■ ■ Sn+i) by 

N-n 



which, however, introduces correlations. To avoid that, one can produce independent 
reahsations at a significantly higher cost: While N trajectories produce N — n correlated 
samples, they produce only N/{n + 1) uncorrelated samples, using a new set of n + 1 
trajectories for every realisation of Si, S2, . . . , s„. Obviously, a correlated sample, for 
instance SiS2; 5253, • • • , s-js^ {N = 9,n = 2), contains multiple uncorrelated sub-samples, 
for example SiS2! S4S5, sySg but also 3283,353^ etc. The correlated sample therefore 
converges at least as fast as its uncorrelated sub-samples, however, possibly including 
anti-correlations. 

It is very important to produce large samples, because of "exponentially rare but 
important events" [19]. For example, estimating (S1S2S3) we found an exceptionally 
large event where S1S2S3 was almost 10^^ times bigger than the average (S1S2S3) up to 
that point, estimated from about 3.4-10^^ sequential samples, for xq = 2 and t = 2^"^. At 
the end of the simulations, the inclusion of the extreme event still increased the average 
(•SiS2'S3) by a factor of 3.7. 

In addition to the statistical error, there arc errors from the discretisation and, 
closely related, the size oi xq. Ideally, in a numerical simulation of a continuous random 
walk, xo and t were chosen as large as possible, which is, however, computationally 
very costly. To estimate the influence of lattice effects due to small xq, we compared 
sIs^(i;xo) to the exact result (15) for different Xo — 2,4,8,32. The result is shown in 
Figure 5 in the form s^s^/ (S1S2) versus t. As expected, it suggests large Xq to avoid 
lattice effects, which clashes with the demand for large T — tD/x^ to determine large 
time asymptotes in T. Comparing the product of the square of the statistical error 
and the CPU-time spent on the results for different xq indicates that a sequential, i.e. 
correlated, sec (40), simulation of Xq = 2 is most efficient. 

For technical reasons, the error bars shown in the graphs are estimated from the 
variance of the estimator for small subsamples. For example, the independent samples 
were created from 1800 individual runs, each consisting of about 7.2 • 10^ to 61 • 10^ 
independent samples totalling to 4.1 • 10^° samples, produced by fair random walks as 
described above. However, while (S1S2) suggests satisfactory agreement between discrete 
numerical and continuous analytical results, as mentioned above, rare events made it 
very hard to estimate (S1S2S3) or higher correlation functions. The most obvious way 
to overcome this problem is to use importance sampling [20] which, however, requires 
a method to produce these rare, important events and a control over their frequency. 
We decided to introduce a repulsive potential between the walkers which therefore run 
simultaneously rather than sequentially. Having no analytical reference other than (s) 
and (S1S2), it is difficult to assess the quality of this approach. Given the strong impact 




(40) 



i=l 
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Figure 5. The ratio of the numerical estimate siS2 and the theoretical value (siS2)- 
The number of samples varies between 4 • 10^° and 1.16 • 10^^. The symbols arc shifted 
relatively to each other to reduce clutter, the error bars span two standard deviations 
in total. The dashed line marks perfect agreement between numerics and theory. 
Data marked (i) is from using independent samples, data marked with (s) comes from 
sequential, correlated runs. Equation (40). The data points marked with arrows have 
been produced by importance sampling, (r). 



of rare events on (S1S2S3), the quality of the numerical estimate J]!^ compared to (S1S2) 
has only very limited indicative power for the quality of S1S2S3. It is remarkable how 
large the error of the importance sampling scheme is for siS2 compared to the naive 
methods, see Figure 5, and how small it is for sls^, see Figure 6. Moreover, the 
numerically estimated error bar of any estimator is expected to suffer from the same 
problems as the estimator itself, i.e. too small or wrongly biased samples. 

The key result for (S1S2S3) is shown in Figure 6 in the form (S1S2S3) /[{xl/ DfT^/^], 
where we divide by (xq/ DY to render the result dimensionless and by T'^/^ to remove 
the suspected leading order. It remains somewhat inconclusive whether the fit to a 
logarithm in the intermediate region breaks down for the sequential runs at large T only 
because of a lack of statistics. The latter is, however, consistent with the observation 
that underestimation of (S1S2S3) is more than twice as frequent as overestimation even 
when averaging over about 4-10^ samples. The estimates produced from the importance 
sampling method seem to confirm the presence of the logarithm even for the largest 
t = 131072. 

From the present result, we can quite confidently rule out gap scaling, which would 
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Figure 6. The numerical estimate sTsiss in the form siS2S3/[{x'q/ D^T^^"^] in a log- 
hn plot. Symbols corresponding to different initial spacings are shifted relatively 
to each other to reduce clutter. Data marked (i) is from independent samples, (s) 
from sequential samples, (r) from importance sampling with repulsive potential. The 
straight line is a fit to a logarithm aln(r/6) to guide the eye. The importance sampling 
result (see arrows) fully agrees with the logarithmic fit in the intermediate region of 
the naive scheme. It remains unclear why for large T the results from independent 
samples deviate so significantly from the logarithmic behaviour. 



require (S1S2S3) /t^ to converge in the limit of large i to a non- vanishing value. A direct 
measure for the validity of gap scaling is a moment ratio of the form [6] 

{S1S2) 

which converges to a non- vanishing value if the observables obey gap scaling, irrespective 
of the value of the exponents. The numerical estimate of is shown in Figure 7 (using 
the analytical results for (s) and (S1S2)). Its decay to indicates once more that {siS2S^) 
does not diverge fast enough. 

Along the same lines, Figure 8 shows SiS2S^Si/[{x1/ DYT^^"^] together with a fit to 
a parabola in ln(T). While the fit is not perfect and the data is, apart from the result 
from importance sampling, slightly inconsistent for large T, the suggested parabola 
seems plausible. The data is certainly not consistent with gap scaling. 
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Figure 7. The moment ratio gs, Equation (41), would converge to a nonzero value in 
the case of gap scahng. As in previous figures, data are shifted to avoid chittcr. They 
show different initial spacings xo and different simulation methods [(i) independent, 
(s) sequential, (r) repulsive]. 



4. Summary 

Motivated by the question whether the sequence of the areas between trajectories of 
coalescing random walkers exhibits multiscaling, we have calculated the second moment 
and subsequently the two point correlation function exactly. The correlation function 
displays anti-correlations, as was discovered in the analysis of the variance of the 
numerical estimator of the first moment. Anti-correlations are sometimes regarded as 
indicating multiscaling [10, 14]. 

Three moments are necessary to expose the absence of gap scaling, which 
necessitated a numerical calculation. Due to the presence of rare but important events, 
convergence is slow, yet an importance sampling scheme produces results consistent with 
logarithmic scaling, which is often observed in conjunction with multiscaling [10, 14]. 

While the analytical proof for multiscaling in the present system is still lacking, 
the numerical results indicate it convincingly. Two possible avenues may be pursued 
in the future: An exact solution along the lines discussed in section 3.1.3, or an exact 
calculation based on the work by Munasinghe et al [10, 14]. 




Figure 8. The numerical estimate S1S2S3S4 in tlic form si .S2 ■S3.S4 / [(.Xq / D^T'^^'^] in a 
log-lin plot, similar to Figure 6. Data is shifted to avoid clutter, and shows different 
initial spacings xq and different simulation techniques. The full line is a fit to a parabola 
in the logarithm of T, i.e. a\n{T/h) + c\n{T/hf. 
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